Skip to content

new module: bio2zarr/vcfpartition#11098

Merged
camlloyd merged 10 commits intonf-core:masterfrom
camlloyd:vcfpartition
Apr 15, 2026
Merged

new module: bio2zarr/vcfpartition#11098
camlloyd merged 10 commits intonf-core:masterfrom
camlloyd:vcfpartition

Conversation

@camlloyd
Copy link
Copy Markdown
Member

@camlloyd camlloyd commented Mar 31, 2026

PR checklist

Closes #11097 by adding new module: bio2zarr/vcfpartition

  • This comment contains a description of changes (with reason).
  • If you've fixed a bug or added code that should be tested, add tests!
  • If you've added a new tool - have you followed the module conventions in the contribution docs
  • If necessary, include test data in your PR.
  • Remove all TODO statements.
  • Broadcast software version numbers to topic: versions - See version_topics
  • Follow the naming conventions.
  • Follow the parameters requirements.
  • Follow the input/output options guidelines.
  • Add a resource label
  • Use BioConda and BioContainers if possible to fulfil software requirements.
  • Ensure that the test works with either Docker / Singularity. Conda CI tests can be quite flaky:
    • For modules:
      • nf-core modules test <MODULE> --profile docker
      • nf-core modules test <MODULE> --profile singularity
      • nf-core modules test <MODULE> --profile conda
    • For subworkflows:
      • nf-core subworkflows test <SUBWORKFLOW> --profile docker
      • nf-core subworkflows test <SUBWORKFLOW> --profile singularity
      • nf-core subworkflows test <SUBWORKFLOW> --profile conda

@camlloyd camlloyd marked this pull request as draft March 31, 2026 17:28
@camlloyd camlloyd marked this pull request as ready for review March 31, 2026 17:42
Copy link
Copy Markdown
Contributor

@Joon-Klaps Joon-Klaps left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice addition! Minor suggestions.
Wondering if this is actually not a subcommand of a larger tool bio2zarr, which would make the name of the tool modules/nf-core/bio2zarr/vcfpartition, thoughts?

Comment thread modules/nf-core/bio2zarr/vcfpartition/tests/main.nf.test
Comment thread modules/nf-core/bio2zarr/vcfpartition/tests/main.nf.test
Comment thread modules/nf-core/bio2zarr/vcfpartition/tests/main.nf.test
Comment thread modules/nf-core/bio2zarr/vcfpartition/tests/main.nf.test
Comment thread modules/nf-core/vcfpartition/main.nf Outdated
@camlloyd
Copy link
Copy Markdown
Member Author

camlloyd commented Apr 1, 2026

Nice addition! Minor suggestions. Wondering if this is actually not a subcommand of a larger tool bio2zarr, which would make the name of the tool modules/nf-core/bio2zarr/vcfpartition, thoughts?

Good question. I think one argument is that bio2zarr is the repository/package name, and vcfpartition exists at the tool level. The package also contains vcf2zarr, which has its own subtools convert, explode etc.

I previously added the modules vcf2zarr/convert and vcf2zarr/explode.

I didn't consider the other tools then, but now I want to work on plink2zarr, so I'm happy to re-think this approach.

@camlloyd
Copy link
Copy Markdown
Member Author

camlloyd commented Apr 1, 2026

Or, more simply, there is no bio2zarr command.

@jfy133
Copy link
Copy Markdown
Member

jfy133 commented Apr 1, 2026

Nice addition! Minor suggestions. Wondering if this is actually not a subcommand of a larger tool bio2zarr, which would make the name of the tool modules/nf-core/bio2zarr/vcfpartition, thoughts?

Good question. I think one argument is that bio2zarr is the repository/package name, and vcfpartition exists at the tool level. The package also contains vcf2zarr, which has its own subtools convert, explode etc.

Are there other commands within the bio2zarr package/repository? If so all would go under bio2zarr/<command name> in my opinion.

But even then, if vcfpartition is the sole tool within the repository name, it's a very generic name 😅 so prefixing with bio2zarr would be nicer.

I previously added the modules vcf2zarr/convert and vcf2zarr/explode.

I didn't consider the other tools then, but now I want to work on plink2zarr, so I'm happy to re-think this approach.

I see why you would opt for that here though... it's a bit nasty from the developers...

When did you add the previous modules? Do you think they are already being used in pipelines, or are they fresh modules that we could rename them (e.g. to `bio2zarr/vcf2zarrexplode)?

Another annoying example i've encoutnered is the conda recipe being called ncbi-datasets but the actual executed command is just datasets (very generic)

But let's also see what other maintainers think... ultimately it's about findabilty at this poitn

@camlloyd
Copy link
Copy Markdown
Member Author

camlloyd commented Apr 1, 2026

One yesterday, one last year. Neither used in pipelines as far as I can tell.

bio2zarr contains:

  • vcf2zarr
  • plink2zarr
  • tskit2zarr
  • vcfpartition

@camlloyd camlloyd changed the title new module: vcfpartition new module: bio2zarr/vcfpartition Apr 15, 2026
@camlloyd camlloyd requested a review from Joon-Klaps April 15, 2026 10:38
Copy link
Copy Markdown
Contributor

@Joon-Klaps Joon-Klaps left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for being a nitpicky on the name but I think it will be easier now for everyone to use this

@camlloyd
Copy link
Copy Markdown
Member Author

I will update the other modules in a follow-up PR. Thank you!

@camlloyd camlloyd added this pull request to the merge queue Apr 15, 2026
Merged via the queue into nf-core:master with commit 38ffe77 Apr 15, 2026
28 checks passed
@camlloyd camlloyd deleted the vcfpartition branch April 15, 2026 10:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

new module: bio2zarr/vcfpartition

3 participants